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Introduction 

We present, in this paper, a particularly simple lazy machine which runs 
programs written in A-calculus. It was introduced by the present writer 
more than twenty years ago. It has been, since, used and implemented 
by several authors, but remained unpublished. 

In the first section, we give a rather informal, but complete, descrip- 
tion of the machine. In the second part, definitions are formalized, 
which allows us to give a proof of correctness for the execution of A- 
terms. Finally, in the third part, we build an extension for the machine, 
with a control instruction (a kind of call-by-name call/cc) and with 
continuations. 

This machine uses weak head reduction to execute A-calculus, which 
means that the active redex must be at the very beginning of the A- 
term. Thus, computation stops if there is no redex at the head of the 
A-term. In fact, we reduce at once a whole chain \x\ . . . \x n . Therefore, 
execution also stops if there are not enough arguments. 

The first example of a A-calculus machine is P. Landin's celebrated 
SECD-machine [7]. The one presented here is quite different, in partic- 
ular because it uses call-by-name. This needs some explanation, since 
functional programming languages are, most of the time, implemented 
through call-by-value. Here is the reason for this choice : 
Starting in the sixties, a fascinating domain has been growing between 
logic and theoretical computer science, that we can designate as the 
Curry-Howard correspondence. Succinctly, this correspondence permits 
the transformation of a mathematical proof into a program, which is 
written : 

in A-calculus if the proof is intuitionistic et only uses logical axioms ; 

in A-calculus extended with a control instruction, if one uses the law 
of excluded middle [2] and the axioms of Zermelo-Frasnkel set theory [4] , 
which is most often the case. 

Other instructions are necessary if one uses additional axioms, such as 
the Axiom of Choice [5] . 

© 2006 Springer Science+Business Media, Inc. Manufactured in The Netherlands. 
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The programs obtained in this way are indeed very complex and two 
important problems immediately arise : how should we execute them 
and what is their behaviour? Naturally, these questions are not inde- 
pendent, so let us give a more precise formulation : 

(i) How should one execute these programs so as to obtain a meaningful 
behaviour ? 

(ii) Assuming an answer to question (i) , what is the common behaviour 
(if any) of the programs obtained from different proofs of the same 
theorem ? 

It is altogether surprising that there be an answer to question (i) ; it is 
the machine presented below. I believe that is, in itself, a strong reason 
for being interested in it. 

Let us give a very simple but illuminating example, namely the follow- 
ing theorem of Euclid : 

There exists infinitely many prime numbers. 
Let us consider a proof D of this theorem, using the axioms of classical 
analysis, or those of classical set theory ; consider, further, the program 
Pd extracted from this proof. One would like to have the following 
behaviour for Pd : 

wait for an integer n ; 

produce then a prime number p > n. 

That is exactly what happens when the program Pd is executed by 
the present machine. But it's not true anymore if one uses a different 
execution mechanism, for instance call-by-value. In this case one gets, 
in general, an aberrant behaviour and no meaningful output. 

This machine was thus conceived to execute programs obtained from 
mathematical proofs. It is an essential ingredient of the classical real- 
izability theory developed in [4, 5] to extend the Curry-Howard cor- 
respondence to analysis and set theory. Thanks to the remarkable 
properties of weak head reduction, one can thus, inter alia, search 
for the specification associated with a given mathematical theorem, 
meaning the shared behaviour of the programs extracted from the 
various proofs of the theorem under consideration : this is question (ii) 
stated earlier. That problem is a very interesting one, it is also quite 
difficult and has only been solved, up to now, in very few cases, even 
for tautologies (cf. [6]). A further interesting side of this theory is that 
it illuminates, in a new way, the problem of proving programs, so very 
important for applications. 
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1. Description of the machine 

Terms of A-calculus are written with the notation (t)u for application of 
t to u. We shall also write tu if no ambiguity arise ; (. . . {{t)u{)u2 ■ ■ )uk 
will be also denoted by [t)u\ ... or tu\ . . . u^. 

We consider three areas in the memory : the term area where are written 
the A-terms to be performed, the stack and the heap. We denote by kt 
the address of the term t in the term area. 
In the heap, we have objects of the following kinds : 

— environment : a finite sequence (e, £1, . . . , where e is the address 
of an environment (in the heap), and £1, . . . , are closures. There 
is also an empty environment. 

— closure : an ordered pair (Set, e) built with the address of a term 
(in the term area) and the address of an environment. 

The elements of the stack are closures. 

Intuitively, closures are the values which A-calculus variables take. 
Execution of a term 

The term to to be performed is written, in "compiled form" in the term 
area. The "compiled form" of a term is obtained by replacing each 
occurrence of Ax with A and each variable occurrence with an ordered 
pair of integers (y, k) (it is a variant of the de Bruijn notation [1], see 
the definition below). We assume that to is a closed term. Thus, the 
term area contains a sequence of closed terms. 

Nevertheless, terms may contain symbols of constant, which are per- 
formed with some predefined programs. For example : 

— a constant symbol which is the name of another closed term ; the 
program consists in the execution of this term. 

— constant symbols for programs in an input-output library. 

The execution consists in constantly updating a closure (T, E) and the 
stack. T is the address of the current subterm (which is not closed, in 
general) : it is, therefore, an instruction pointer which runs along the 
term to be performed ; E is the current environment. 
At the beginning, T is the address of the first term to to be performed. 
Since it is a closed term, E is the null pointer (which points to the 
empty environment). 

At each moment, there are three possibilities according to the term 
pointed by T : it may be an application (t)u, an abstraction Xx t or a 
variable. 
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— Execution of (t)u. 

We push the closure {ku, E) on the top of the stack and we go on 
by performing t : thus T points now to t and E does not change. 

— Execution of Xx\ . . . Xx n t where t does not begin with a A ; thus, 
T points to Xx\. 

A new environment (e, £1, . . . , £ n ) is created : e is the address of E, 
£1, . . . ,£ n are "popped" : we take the n top entries off the stack. 
We put in E the address of this new environment in the heap, and 
we go on by performing t : thus T points now to t. 

— Execution of x (a A-calculus variable). 

We fetch as follows the value of the variable x in the environ- 
ment E : indeed, it is a bound occurrence of x in the initial term 
to. Thus, it was replaced by an ordered pair of integers <u,k>. If 
v = 0, the value we need is the fc-th closure of the environment E. 
If v > 1, let Ei be the environment which has its address in E, Ei 
the one which has its address in Ei, etc. Then, the value of x is 
the fc-th closure of E v . This value is an ordered pair (T", E') which 
we put in (T, E). 

Remark. 

The intuitive meaning of these rules of execution is to consider the symbols 
Ax, (,x of A-calculus as elementary instructions : 

• "Ax" is : "pop" in x and increment the instruction pointer. 

• "(" is : "push" the address of the corresponding ")" and increment the 
instruction pointer. 

• "x" is : go to the address which is contained in x. 

It remains to explain how we compute the integers v, k for each occur- 
rence of a variable x, i.e. how we "compile" a closed A-term t. More 
generally, we compute v for an occurrence of x in an arbitrary A-term t, 
and k when it is a bound occurrence in t. This is done by induction on 
the length of t. 

If t = x, we set v = 0. If t = uv , the occurrence of x we consider is in 
u (resp. v). We compute v, and possibly k, in u (resp. v). 
Let now t = Xx\ . . . Xx n u with n > 0, u being a term which does 
not begin with a A. If the occurrence of x we consider is free in t, we 
compute v in t by computing v in u, then adding 1. If this occurrence 
of x is bound in u, we compute v and k in u. Finally, if this occurrence 
is free in u and bound in t, then we have x = X{. We compute v in u, 
and we set k = i. 
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2. Formal definitions and correction proof 

Compiled terms or A^-terms (this notion is a variant of the de Bruijn 
notation) are defined as follows : 

• A constant a or an ordered pair <v, k> (k > 1) of integers is a 
As-term (atomic term). 

• If t, u are A^-terms, then so is (t)u. 

• If t is a A^-term which does not begin with A* and if n > 1, then 
\ n t is a As-term. 

Let us consider, in a A^-term t, an occurrence of a constant a or of 
<u,k> (ordered pair of integers). We define, in an obvious way, the 
depth of this occurrence, which is the number of A n symbols above it. 
The definition is done by induction on the length of t : 
If there is no A* symbol in t, the depth is 0. 

If t = (u)v, the occurrence we consider is either in u or in v. We compute 
its depth in this subterm and do not change it. 

If t = X n u, we compute the depth of this occurrence in the subterm u 
and we add 1 to it. 

An occurrence of <v, k> in t is said to be free (resp. bound) if its depth 
in t is < v (resp. > v). 

Of course, each occurrence of a constant a is free. Thus, we could write 
constants as ordered pairs <co,k>. 

Consider a bounded occurrence of <is, k> in a A^-term t ; then there 
is a unique A ra in t which bounds this occurrence. If k > n, we say that 
this occurrence of <v, k> in t is dummy. A A^-term without dummy 
bound occurrences will be called good. We can easily transform a A^- 
term into a good one : simply substitute each dummy occurrence with 
a (unique) new constant symbol d. 

Alpha-equivalence 

Let t be a closed A-term, with constants. We define, by induction on t, 
its "compiled" form, which is a A^-term denoted by B(t) : 
If a is a constant, then B(a) = a ; if t = uv, then B(t) = B(u)B(v). 
If t = Xxi . . . Xx n u where u does not begin with A, consider the A^- 
term : B(n[ai/xi, . . . , a n /x n ]), where a\, . . . ,a n are new constants. 
We replace in it each occurrence of a« with the ordered pair <u,i>, 
where v is the depth of this occurrence in B(u[ai/ 'x\, . . . , a n /x n ]). We 
get in this way a A^-term U and we set : B(t) = X n U. 

THEOREM 1. A Xs-term r is good iff there exists a X-term t such 
that t = B[t]. 
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We omit the easy proof. 

□ 

The compiled form B[t] of a A-term t is a variant of the de Bruijn 
notation for t. Its main property, expressed by theorem 2, is that it 
depends only on a-equivalence class of t. This property is not used in 
the following, but the simplicity of the proof below convinced me to 
give it here. 

THEOREM 2. Two closed X-terms t,t' are a-equivalent (which we 
denote by t ~ a t! ) if and only if B(t) = B(t'). 

The proof is done by induction on t. The result is clear if t = a or 
t = uv. So, we assume now that t = Xxi . . . Xx n u where u does not 
begin with A. If t ~ a t' or if B(t) = B(t'), then t' = Xx[ . . . Xx' n v! where 
u' does not begin with A. Let a±,. . . ,a n be new constants ; then, by 
definition of a-equivalence [3], we have : 
t~ a t' u[a 1 /x 1 , a n /x n ] ~ Q u'[ai/x[, a n /x' n \. 
By induction hypothesis, this is equivalent to : 
B(u[ai/xi, . . .,a n /x n ]) = B(u'[ai/x[, . . .,a n /x' n ]). 
If B(u[ai/xi, . . . , a n /x n ]) = B(u'[ai/a; / 1 , . . . , a n /x' n ]), we obviously have 
B(t) = B(t'). But conversely, we get B(u[ai/xi, . . . , a n /x n \) from B(t), 
by removing the initial X n and replacing <v,i> with ai for every oc- 
currence of <v, i> the depth of which is precisely equal to v. 
Therefore, we have B(u[ai/xi, . . . ,a n /x n ]) = B(u'[ai/x'i, . . . , a n /x' n ]) 
& B(t) = B(t') and finally t~ a t' O B(t) = B(t'). 

□ 

Weak head reduction 

Consider a A^-term of the form (X n t)ui . . . u p with p > n. Then, we can 
carry out a weak head reduction step : we get the A#-term t'u n+ i . . . u p 
(or t', if n = p) ; the term t' is obtained by replacing in t each free 
occurrence of <v,i> with : 

<v — l,i> if v is strictly greater than the depth of this occurrence ; 

Ui (resp. d) if v is equal to the depth of this occurrence and i < n 
(resp. i > n) ; d is a fixed constant, which replaces dummy bound 
occurrences in X n t. 

We write t >- u if u is obtained from t by a finite (possibly null) number 
of weak head reduction steps. 

It is clear that the weak head reduction of a A-term t corresponds to 
the weak head reduction of its compiled form B(t). 
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Closures, environments and stacks 

We now define recursively closures and environments : 

0 is an environment (the empty environment) ; if e is an environ- 
ment and (f) n are closures (n > 0), then the finite sequence 
(e, (pi, . . . , 4> n ) is an environment. 

A closure is an ordered pair (t,e) composed with a A^-term t and an 
environment e. 

A stack is a finite sequence tt — • • • , 4*n) of closures. 

We denote by 4>.ir the stack (<f>, 4>i, ■ ■ ■ , 4>n) obtained by "pushing" the 

closure <j> on the top of the stack tt. 

Execution rules 

A state of the machine is a triple (t,e,ir) where t is a A^-term, e an 
environment and tt a stack. We now give the execution rules, by which 
we pass from a state (t, e, tt) to the next one (tf, e' , tt') : 

• If t = (u)v, then t' = u, e' = e and tt' = (v, e).TT. 

• If t = X n u, the length of the stack tt must be > n, otherwise the 
machine stops. Thus, we have tt = <\>\ . . .<p n -^', which defines tt'. We 
set t' = u and e' = (e, <f>i, . . . , (p n ). 

• If t = <v, k> : let eo = e and let e%+\ be the environment which is 
the first element of e^, for i = 0, 1, ... 

If ej = 0 for an i < u, then the machine stops. 
Otherwise, we have e u = (e u+ i, (t±,£i), . . . , (t p , e p )). 
If k < p, we set t' = tk, e' = £k and tt' = tt. 
If k > p, then the machine stops. 

The value of a closure 

Given any closure <p = (i,e), we define a closed A#-term which is 
denoted by (f> or t[e] ; it is defined by induction on the environment 
e as follows : 

If e = 0, we obtain t[0] by replacing in t each free occurrence of 
<v,i> (i.e. its depth is < v) with the constant d. 

If e = (e, 4>\, . . . , (f> n ), we set t[e] = u[e] where u is the A^-term we 
obtain by replacing in t each free occurrence of <v,i> with : 
<v — 1, i> if i/ is strictly greater than the depth of this occurrence ; 
4>i (resp. d) if z/ is equal to the depth of this occurrence and i < n (resp. 

1 > n) ; d is a fixed constant. 

Remark. We observe that i[e] is a closed As-term, which is obtained by 
replacing in t the free occurrences of <v,i> with suitable A^-terms. These 
closed Ae-terms are recursively provided by the environment e ; the constant 
d (for "dummy" ) is used as a "wild card" , when the environment e does not 
provide anything. 
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THEOREM 3. Let (t,e,ir), (t',e',ir') 6e too consecutive states of the 
machine, with tt = (fa, . . . , <^ m ) and 7r' = (fa, . . . , <^/). 
T/ien, we have : t[e]fa . . . (f> m >- t'[e']fa . . . fa m ,. 

Recall that the symbol >- denotes the weak head reduction. We shall 
use the notation t[e]w for t[e]fa . . . 4> m when tt is the stack (fa, . . . , 4> m ). 
There are three possible cases for t : 

• t = (u)v : we have t[e] = u[e]v[e], t'[e'] = u[e] (since e' = e) and 
tt' = (v,e).ir. Therefore t[e]ir = t'[e']it'. 

• t = <v, k> : let eo = e and e,-+i the environment which is the first 
element of ej, if ej / 0. Then, by the reduction rules of the machine, 
we have (t', e') = ipk where ipk is the k-th closure of the environment 
e v = (e v +i,fa, . . . , tp p ) (and k is necessarily < p). Now, by definition of 
t[e], we have t[e] = ipk- Therefore, t[e]lt = = t'[e']jr', since tt' = -it. 

• t = X n u : then we have n <m and 

t' = u, e' = (e, fa, . . . , fa), tt 1 = (fa+i, ■ ■ ■ , 4> m ). We must show that 
( \ n u) [e] fa . . . 4> n y u[(e, fa , . . . , fa)} . 

By the very definitions of the value of a closure and of the weak head 
reduction, we have u[(e, fa, . . . , fa)] = v[e], where v is obtained by 
one step of weak head reduction in (X n u)fa . . . fa. We denote this by 
(X n u)fa . . . fa >~i v, and we now show that (X n u)[e]fa . . . fa >-\ v[e] 
(which will give the result). 

We obtain v[e] by replacing, in u, the free occurrences of <u,i> 

i) with fa (or d if i > n) if v = the depth of this occurrence ; 

ii) with <v — 1, i> if v > the depth of this occurrence ; and after that, 
we replace this occurrence of <v — l,i> with the closed term given by 
the environment e. 

Now, we obtain (A n it)[e] (which is closed) by the substitution (ii) on 
the free occurrences of <v,i> in u such that v > the depth of this 
occurrence in u. Indeed, they are exactly the free occurrences in \ n u. 
Then, one step of weak head reduction on (A n n)[e]^i . . . fa performs 
the substitution (i) on the occurrences of <u, i> in u such that v = the 
depth in u of this occurrence. This shows that this step of reduction 
gives v[e\. 

□ 

This theorem shows that the machine which has been described above 
computes correctly in the following sense : if t >- at\ . . . tk, where t is a 
closed A-term and a is a constant, then the execution of B(t), from an 
empty environment and an empty stack, will end up in aB(t\) . . . B(ifc). 
In particular, if t >~ a, then the execution of B(t) will end up in a. 
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3. Control instruction and continuations 

We now extend this machine with a call-by-name control instruction, 
and with continuations. There are two advantages : first, an obvious 
utility for programming ; second, in the frame of realisability theory 
(see the introduction), this allows the typing of programs in classical 
logic and no longer only in intuitionistic logic. Indeed, the type of the 
instruction call/cc is Peirce's law ((A — > B) — ► A) — > A (see [2]). 
As we did before, we give first an informal description of the machine, 
then mathematical definitions. 

Description of the machine 

We describe only the changes. Terms are the same but there is one 
more constant, which is denoted by cc. There are still three memory 
areas : the stack and the term area, which are the same as before, and 
the heap which contains objects of the following kinds : 

— environment : same definition. 

— closure : it is, either an ordered pair (&t, e) built with the address 
of a term (in the term area) and the address of an environment (in 
the heap) ; or the address &7 of a continuation. 

— continuation : it is a sequence 7 = (£1, • • • , £n) of closures (the 
same as a stack) . 

Execution of a term 

The execution consists in constantly updating the current closure S and 
the stack. There are now two possible forms for the current closure : 
(&r, e) (where r is a term) or &7 (where 7 is a continuation). 
Consider the first case : S = (&r, e) . There are now four possibilities 
for the term r : an application (t)u, an abstraction Xxt, a variable x or 
the constant cc. Nothing is changed during execution in the first two 
cases. 

— Execution of x (A-calculus variable). 

As before, we fetch the value of the variable x in the environment 
e, which gives a closure £ which becomes the current closure S. 
The stack does not change. 

— Execution of cc. 

We pop a closure £ which becomes the current closure E. We save 
the current stack in a continuation 7 and we push the address of 7 
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(this address is a closure) on the top of the stack. 

Therefore, the stack, which was of the form (£, £1, . . . , £„), has 

become (&7, £1, . . . , £„) with 7 = (£1, . . . , £„). 

Consider now the second case, when the current closure S is of the form 
&7. Then, the execution consists in popping a closure £, which becomes 
the current closure and in replacing the current stack with 7. 

Formal definitions 

A^-terms are defined as before, with a distinguished constant, which is 
denoted by cc. 

We define recursively the closures, the environments and the stacks 
(which are now also called continuations) : 

0 is an environnement (the empty environnement) ; if e is an envi- 
ronment and fa, . . . , (j> n are closures (n > 0), then the finite sequence 
(e, <pi, . . . , 4> n ) is an environnement. 

A closure is either a stack, or an ordered pair (t, e) composed with a 
A^-term t and an environment e. 

A stack (or continuation) is a finite sequence 7 = (0i, . . . , <p n ) of clo- 
sures. We denote by ^.7 the stack (0, </>i, . . . , </> n ) which is obtained by 
"pushing" the closure 4> on the top of the stack 7. 

Execution rules 

A state of the machine is an ordered pair (0, tt) where eft is a closure 
and 7r is a stack. We give now the execution rules, by which we pass 
from a state (0, tt) to the next one (<fi', tt') : 

• If 0 is a continuation (i.e. a stack), then 0' is the closure which is on 
the top of the stack tt (if tt is empty, the machine stops) and it' = <p. 

• Else, we have 4> = (t, e) and there are four possibilities for the 
A^-term t : 

• If t = (u)v, then eft' = (u, e) and tt' = (v, e).ir. 

• If t = X n u, then the length of the stack tt must be > n, otherwise 
the machine stops. Thus, we have tt = <p\ . . . <j) n .ir' , which defines tt' . 
We set (j)' = (u, e') with e' = (e, (f>i, . . . , (ft n ). 

• If t = <u, k> : let eo = e and let ej + i be the environment which 
is the first element of e«, for i = 0, 1, . . . If ej = 0 for an i < v, then 
the machine stops. Else, we have e v = (e u +i, (f)±, ... , 4> p ). 

If k < p, we set <p' = (pk and tt' = tt. If k > p, the machine stops. 

• If i = cc, then <// is the closure which is on the top of the stack tt 
(if tt is empty, the machine stops). Thus, we have tt = 4>'.p where p is a 
stack. Therefore, p is also a closure, which we denote by <p p . Then, we 
set tt' = 4>p.p. 
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